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Abstract: This short note is the result of a French “Hippocampe internship”!^ that aims at 
introducing the world of research to young undergraduate French students. The problem studied 
is the following: imagine yourself locked in a cage barred with n different locks. You are given a 
keyring with N > n keys containing the n keys that open the locks. In average, how many trials 
are required to open all locks and get out? The article studies 3 different strategies and compare 
them. Implementation of the strategies are also proposed as illustrations of the theoretical results. 

1 Statement of the problem 

The precise statement of the problem and the various assumptions are the following: 

Statement of the problem: We need to open n locks with a keyring containing N > n keys. 
We are interested in determining the average and the standard deviation of the number of trials 
to open all locks. The special case when the number of keys eguals the number of locks (n = N) is 
included. 

Assumptions: The following assumptions about locks and keys are made: 

• All locks and keys are assumed to be indistinguishable. 

• All locks are different and require a specihc key to be opened. 

• A lock can always be opened, i.e. the key required to open it belongs to the keyring. 

• A key cannot open more than one lock. 

Consequently, the keyring contains the n different keys corresponding to the locks as well as 
additional keys that do not open anything when N > n. The keys are randomly placed in the 
keyring. We denote by tv fhe number of trials to open all locks. We are interested in gathering 
as much information as possible on Tn,N especially E{Tn^N) and Var(T„^Tv)- 

2 Strategy 1 : The totally random strategy 

The hrst strategy one can think of is to select randomly one of the remaining keys and try it ran¬ 
domly into one the remaining locks to open. In particular we do not keep track about the previous 
tries except for the fact that we discard locks that are already opened and their corresponding 

^ “Hippocampe” (sea-horse or hippocampus in French) internships are three days sessions during which small 
groups of students specialized in mathematics and physics are presented with a problem that they have to solve 
under the supervision of an academic professor. 
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keys. Let us introduce Xi {1 < i < n) the number of trials required between the opening of i 
lock and the lock, (for i = 1, Xi is just the number of trials required to open the first lock). 
Thus, we have: 

n 

T^,n = J2x, ( 2 . 1 ) 

i=l 

In this strategy, the variables Xi are obviously independent since we do not keep track of the 
history of the trials. To open the lock, we choose randomly into the N — i + 1 remaining keys. 
Therefore we get: 


Vfc > 1 : P{Xi = k) 



N-i 
N-i + 1 


1 

N-i + 1 


k N-i 1 

\N-i + l) N-i + 1 


( 2 . 2 ) 


Thus we observe that Xj is distributed as a geometric variable with parameters g* = This 

is of course coherent with the fact that the strategy is memory-less. It is then straightforward to 
obtain the mean and standard deviation of Xp 


E{Xi) = X - i + 1 and Var(Xi) = (X - i){N - i -M) 

Since the variables Xj are independent, we get from ( 2.1[ ) that: 

vL n(2N — n+l) n(3N'^ — 3Nn +— 1) 

EiTn^M) = = --- - and Var(T„,Ar) = ^ Var(Xj) = --- 


(2.3) 


( 2 . 4 ) 


2 = 1 


2=1 


In the case when X = n, we get: 


£)(r„,„) = and Var(r„,„) _ " Dl" + D 


(2.5) 


2 ' ’ ' 3 

Simulations (See show that T„ is very well approached by a T distribution with parameters 
T ^/c = ; d = ^^^^( 2 jv-n+i) ‘ Parameters have been adjusted here so that expec¬ 

tation and standard deviation match (we recall that E{V{k,6)) = kO et Var{V{k,6)) = kO"^). 
Appearance of the T distribution is not surprising here. Indeed, geometric variables are a discrete 
version of the exponential distribution (typical of memory-less phenomenon) that is a special case 
of T distributions {k = 1). Additionally, the sum of independent T distributions with parameters 

However note that 


{ki,6) is known to remain a T distribution with parameters ( k = '^ki,6 

V i=l 2 

our parameters 9i are different so that the last theorem cannot be applied properly but only in an 
approximate manner. 

Of course, the totally random strategy is not the optimal strategy since we do not keep track 
of the combinations tried previously. These strategies are developed in the next sections. 


3 Strategy 2 : Take a lock and test all keys 

Strategy: The strategy is now the following: we choose a lock and then we try all remaining (that 
is to say all keys except those that have already opened a lock) keys successively until we find the 
(unique) right key that opens the lock. Then we continue with the next lock by starting over with 
the first remaining key and so on until all are opened. 
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Remark : Since a key opens at most one lock, we can discard the successful keys. Moreover 
when a lock is opened, all remaining keys (including those that have been tried and those that 
have not been tried on the lock) remain equivalent for the next locks. Indeed, since we now that 
a lock is exactly opened by one key, we get that when a lock is opened, all keys that have not 
been tried immediately acquire the property that they do not open this lock, a property that was 
already known for the keys that have been tried on the lock. In particular the way we will order 
the keys for the next lock does not affect the distribution of the number of trials. 


As in the previous section, let Xj {1 < i < n) be the number of trials required between the 
opening of i — lock and the lock, (for z = 1, Xi is just the number of trials required to 
open the first lock). Equation (2.1) still holds. Moreover from the previous remark, we get that 
the variables Xj are still independent. To open the lock, only X + 1 — z keys remain and we 
get for 1 < fc < X + 1 — z: 


P{Xi = k) 


N — i X — z — 1 
X + 1 — z X — z 


N + l-i-k 1 _ 1 

N+ 2 — i — kN + 1 — i — k X + 1 — z 


(3.1) 


Consequently we observe that Xj follows a discrete uniform distribution: 


Xj ~ Uniform ({1,..., X — z + 1}) 


(3.2) 


In the end we get: 


r./.. 3 X + 2-Z (X + 2-z)(X-z) 

E(Xi) = - - -and Var(Xj) ^- — - 


(3.3) 


and eventually 


^ n(2X + 3-n) ^ ^ n{6N^ - 6Nn + 6N + 2n‘^ - 3n - 5) 

E{Tn,N) = -^- and Var(T„,Ar) =- — - (3.4) 


In the case when X = zz we get: 


E(Tn,n) 


n{n + 3) 
4 


and Yw:{Tn,n) 


n{2n + 5)(n — 1) 
72 


(3.5) 


Simple computations shows that this strategy is in average better than the totally-random one 
even in the case when N = n. Simulations (See section of this strategy are also in agreement 
with the theoretical results. 


4 Strategy 3 : Take a key and try it in all locks 


Strategy: We now consider the following strategy: we take a key and we try it into all remaining 
locks. We stop when it opens a lock or when we have exhausted all remaining locks. We then take 
the next key and start over with the first remaining lock and so on until all locks are opened. 


Let us denote as before Xj (1 < z < zz) the number of trials required between the opening 
of z — lock and the lock, (for z = 1, Xi is just the number of trials required to open 
the first lock) so that equation ( |2.1 ) still holds. In this situation, the variables Xj are no longer 
independent. Indeed, let us assume that to open the first lock, we have used all X — zz useless keys 
(keys that do not open any locks), then it is now impossible that X 2 is greater than zz — 1 since 
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we only have n — 1 keys and n — 1 locks remaining. On the contrary, if the first key opens a lock 
(that is Xi < n), then X 2 may be strictly greater than n if we choose a useless key. Hence we see 
that the range of X 2 depends on the result of Xi, thus Xi and X 2 cannot be independent. The 
case N = n appears to be singular, since in that case the variables Xj remain independent (since a 
key must open a lock). Determining the distribution of Xj is more challenging too. Indeed Xp = k 
depends on the number of keys that have been discarded until the opening of the p — 1*^ lock. 
More specifically, if I locks have been opened before the trial of the key, then the key opening 
lock p must be in position k + l in the keyring. Consequently we get: 


N 

We want to prove the following: 

Proposition 4.1 The following identity holds: 


p-l fk+l-l\ /N-k-l\ 

= ^) = 4 E - ' 


1=0 


(E) 


(4.1) 


p-l (k+l-l\ (N-k-l\ 
1 ^ V I ) \p-l-l) 


N 


1=0 


c;) 


N — p + 1 


, \/k<N—p + 1 


(4.2) 


Proof : This identity is a consequence of the Chu-Vandermonde identitjj^ 


\/J <K <N : 


X + 1 
K + l 


K 

E 

m=0 


m\ f N — m 


J \K-J 


(4.3) 


Then replacing X by X — 1 and using the symmetry of the binomial coefficients we get to: 

K 


VJ < X < X- 1 : 

Taking K = N — p and J = k — 1 we find 

/ X 

\/k<N — p+1 : ( 

Vp- 1 


X 

N-K-1 


E 

m=0 


m\ / X — 1 — m 


J 


m 


K-J 


N — 1 — m 


(4.4) 


p-\-k—2 

\k — 1 )\N — p — k + 1 

ra=k—\ ^ ^ 


(4.5) 


where we restrict the sum so that only non-zero binomial coefficients are present. Performing the 
change I = m — k + 1 gives: 


p-l 

E 

1=0 


k + l — 1 
k-1 


N-k-l 
N — k — p + 1 


X 


X 


p —1/ X—p-|-l\p—1 


X- 1 


, Vfc<X —p-|-l (4.6) 


This last equality is then equivalent to proposition 4.1 


From proposition 4.1 we conclude that the variable Xp is distributed like a discrete uniform 
distribution: 

Xi~Uniform({l,...,X + l-f}) (4.7) 


^The Chu-Vandermonde identity can be proved easily by identification of the order in the binomial ex¬ 

pansion of the identity (1 -|- x)^+™ = (1 + a:)^(l + x)"' 


4 










Expectation and standard deviation are then straightforward to compute: 


E{X,) = and Var(X,) 


{N + 2 - i){N - i) 
12 


From (2.1) we hnally get: 


i=l 


(iV + 2-z) 
2 


n{2N — n + 3) 
4 


(4.8) 


(4.9) 


We recover the same result as in the previous strategy (3.4). In particular Tn^N is still the sum of 
uniform variables. However in this approach, it is unclear to see why the variables Xi should be 
uncorrelated. Consequently the computation of the variance is not directly possible except for the 
case N = n where the variable X^ are independent and (3.5) is recovered. 


The last result indicates that the last two strategies seem similar since their results are identical. 
However computations in the last strategy seem much more complicated than for the previous one 
and fails to explain why the strategy are equivalent. In fact, the last two strategies are exactly 
equivalent no matter the order of keys selected: they will provide exactly the same 
number of trials for any initial conditions. We will prove this result in the next section using 
a combinatorial one-to-one correspondence. 


5 Equivalence of strategies 2 and 3 

Let us order the locks from 1 to n and number the keys that open a lock with their corresponding 
lock number. The useless keys are then numbered randomly from n-|-l to iV (the number itself is of 
no interest, the only piece of information that we need is that it does not open anything). Then an 
initial condition corresponds to the choice of a random permutation of {1,..., V} of the keys. More 
specifically, the randomness of the problem consists in drawing a random permutation of {1,..., iV} 
uniformly on the symmetric group S'at (with V! possibilities). As soon as the permutation is given 
(i.e. the order of the keys given) then the number of trials is completely determined using one of 
the strategies developed in sections or This defines properly the probability space on which 
we work on. Let us illustrate the situation on the following example: 



— Order defined bv strategy 
o Position of the trial 


Key 1 Key 2 Key 3 Key 4 Key 5 Key 6 Key 7 Key 8 Key 9 

Fig. 1.- Illustration of the strategy developed in section\^when locks 1 up to 5 are opened 
respectively by keys in position 3, 5, 2, 9, 7 in the keyring. 
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Strategy: Tiy a key in all locks 
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Key 1 Key 2 Key 3 Key 4 Key 5 Key 6 Key 7 Key 8 Key 9 
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Order defloed by strategy' 
PositioD of the trial 


Fig. 2: Illustration of the strategy developed in section^when locks 1 up to 5 are opened 
respectively by keys in position 3, 5,2, 9, 7 in the keyring. Boxes ticked in red indicate the position 

of the keys in the keyring. 


For both strategies, it is easy to prove that the only boxes tried in the previous strategies are 
those being on the left and at the top of a ticked box. Indeed: 

1. In strategy developed in section boxes at the right of a ticked box will be discarded because 
we do not try the remaining keys as soon as a lock is open. Boxed below a ticked box will 
not be tried because as soon as a lock is opened, we do not test any other key on it. 

2. In strategy developed in section]^ boxes at the right of a ticked box will be discarded because 
as soon as a lock is opened we do not test any other key on it. Boxes below a ticked box will 
not be tried because as soon as a key has opened a lock, we do not try it on any other lock 
(since we know from the assumptions that it can at most open one lock). 

Hence hgures 1 and 2 show that for any initial condition, the number of trials T„ jv will be the 
same for both strategies: only the order of the various tries will differ in the two strategies. 


6 Numeric simulations 

Numeric simulations can be carried out to illustrate the various strategies. We present here the 
code in Maple as well as histograms of the distributions of T„, 7 v for different values of n and N. 
All strategies require a procedure to generate a random permutation: 

RandomPerm:=proc(n) 
local T: 

T:=[seq(l..n)]: 
return(Shuffle(T)): 
end proc: 

Then, the totally random strategy can be implemented like: 

RandomStrategy:=proc(NumberKeys,NumberLocks) 

local NumberTrials, LocksRemaining, KeysRemaining, LockChosen, KeyChosen: 

NumberTrials:=0: 

LocksRemaining:=NumberLocks: 
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KeysRemaining;=NumberKeys: 
while LocksReniaiiiing>0 do 
LockChosen:=rand(LocksRemaining)0+1: 

KeyChoseii:=rand (KeysRemaining) 0+1: 

if LockChosen=KeyChosen then LocksRemaining:=LocksRemaining-l: 

KeysRemaining:=KeysRemaining-1: fi: 

NumberTrials:=NumberTrials+l: 
od: 

return(NumberTrials): 
end proc: 

The strategy developed in section]^ can be implemented like: 

Strategy2:=proc(T,N) 

local Res,KeyInProgress,LocksRemaining,n,j,Tab: 

Tab:=T: n:=nops(Tab): Res:=0: 

KeyInProgress:=1: LocksRemaining:=N: 

while LocksRemaining>0 do 

if Tab[KeyInProgress]>LocksRemaining then 

Res:=Res+ LocksRemaining: KeyInProgress:=KeyInProgress+l: 
else Res:=Res+ Tab [KeyInProgress]: LocksRemaining:=LocksRemaining-l: 
if KeyInProgress<n then for j from (KeyInProgress+1) to n do 
if (Tab[j]>Tab[KeyInProgress]) then Tab[j]:=Tab[j]-1: fi: od: fi: 

KeyInProgress:=KeyInProgress+l: 
fi: od: return(Res): 
end proc: 

A similar code can be implemented for strategy developed in section Results of the simula¬ 
tions and comparison with the theoretical results are: 
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Fig. 3; Illustration of the totally random strategy in the case n = N = S. Curve in red represents 
the r distribution with the theoretical parameters. Curve in black represents the T distribution 

with parameters inferred from the simulation 
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Fig. 4; Illustration of the totally random strategy in the case n = 5 and N = 10. Curve in red 
represents the T distribution with the theoretical parameters. Curve in black represents the T 
distribution with parameters inferred from the simulation 


9 






























































X 

Fig. 5; Illustration of the strategy developed in section\^in the case N = 20 and n = 10. Curve 
in red represents the normal distribution with the theoretical parameters. Curve in black 
represents the normal distribution with parameters inferred from the simulation 
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Fig. 6: Illustration of the strategy developed in section\^ in the case n = N = 20. Curve in red 
represents the normal distribution with the theoretical parameters. Curve in black represents the 
normal distribution with parameters inferred from the simulation 


One can obviously observe that the theoretical results are in perfect agreement with the simu¬ 
lations. Contrary to the totally random case, T„,Ar is better approximated with a classical normal 
distribution in the other two strategies at least when n is large enough. This is quite logical since 
it seems reasonable that a Central Limit Theorem may be applied for equation (2.1). 


7 Conclusion and Outlooks 

Though the problem can be easily stated and understood by common people (after inquiries, this 
was one of the points that seduced the students to choose this problem among others), its resolution 
presents various levels of mathematics from elementary ones (small values of n and N, totally 
random strategy) to more involved ones (Chu-Vandermonde identity). The problem is also well 
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suited for simulations and implementation even for students with basic notions of programming. 
Basic statistical skills are also required to represent the results of simulations. Retrospectively, in 
addition to the Chu-Vandermonde identity and its connection to the problem, the hardest part for 
students (even with help) was to unravel the equivalence presented in hgure 2. 

The problem raises numerous questions and possible developments: 


• What happens if a key is allowed to open more than one lock? Does it affect the order in 
which we start over when a lock is opened? What happens if some locks are allowed to be 
the same? 


• For low n and V, it seems possible to get the complete probability distribution of 

• Simulations in the totally random case indicates that the distribution of Tn,N should be close 
to a r distribution and arguments where presented to explain this observation. Can these 
arguments be precised and a complete proof given? 


When n is large, simulations shows that the distribution of T„, 7 v approaches a normal dis¬ 
tribution. Equation (2.1) provides a perfect setting to apply some Central Limit Theorem. 
Can we have a detailed proof of the application of the CLT and a bound relatively to the 
error committed in the approximation? 


• Illustration provided by hgure 2 shows that the problem can be recast into a combinatorial 
problem of counting top-left boxes after selecting some at random (imposing one per line). 
Hence it would not be surprising that our problem is equivalent to another one in a totally 
different setting in relation with random permutations. 


• In this article we presented the more natural strategies that come to mind, but are the 
strategies presented here really the optimal strategies to minimize in average T^^at? 

• In real life, changing locks may be more time-consuming than changing keys if the locks are 
located far apart. Conversely, the keyring might be hard to handle so that changing from 
a key to the next one may be complicated. Hence one could introduce different times when 
changing keys or locks. Are the results presented here globally unchanged? 
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